API: add float128 and float32(64,128)_complex dt by samnordmann · Pull Request #492 · openucx/ucc

samnordmann · 2022-04-27T08:20:24Z

What

Add support for four new datatypes: float128, float32_complex, float64_complex, float128_complex.

float32_complex and float64_complex are supported on HOST and CUDA. float128 and float128_complex are supported only on HOST.
The new datatypes are supported by all collectives. float128 supported with the reduction ops: min, max, sum, prod, avg. float32(64,128)_complex supported with the reduction ops: sum, prod, avg.
Update MPI_test and gtest with the new datatypes. Remark: the gtests are run with float128 and float128 only on mc_reduce tests.
new datatypes not supported by nccl nor sharp TLs

manjugv · 2022-04-27T14:30:24Z

@samnordmann It might be better to decouple the formatting changes from the functional changes.

src/components/mc/cpu/reduce/mc_cpu_reduce.h

src/components/mc/cpu/reduce/mc_cpu_reduce_double_complex.c

src/components/mc/cuda/kernel/mc_cuda_reduce_multi_alpha.cu

src/ucc/api/ucc.h

test/mpi/buffer.cc

test/mpi/main.cc

jladd-mlnx

Way too many white-space changes. I had to hide whitespaces in order to make sense of the PR.
The file src/ucc/api/ucc.h is Whitespace-only changes.
Should we include support for complex half precision? Does HPL AI use it?

typedef enum cudaDataType_t
{
        CUDA_R_16F= 2, // 16 bit real 
        CUDA_C_16F= 6, // 16 bit complex
        CUDA_R_32F= 0, // 32 bit real
        CUDA_C_32F= 4, // 32 bit complex
        CUDA_R_64F= 1, // 64 bit real
        CUDA_C_64F= 5, // 64 bit complex
        CUDA_R_8I= 3,  // 8 bit real as a signed integer 
        CUDA_C_8I= 7,  // 8 bit complex as a pair of signed integers
        CUDA_R_8U= 8,  // 8 bit real as a signed integer 
        CUDA_C_8U= 9   // 8 bit complex as a pair of signed integers
} cudaDataType;

samnordmann · 2022-05-09T09:31:43Z

@samnordmann It might be better to decouple the formatting changes from the functional changes.

Way too many white-space changes. I had to hide whitespaces in order to make sense of the PR.

The file src/ucc/api/ucc.h is Whitespace-only changes.

The formatting changes were due to a mistake with automatic formatting. It is fixed now.
@jladd-mlnx @manjugv

src/core/ucc_dt.c

test/mpi/buffer.cc

test/gtest/common/test_ucc.h

test/gtest/common/test.h

Sergei-Lebedev · 2022-06-08T13:39:03Z

src/components/tl/sharp/tl_sharp_coll.c

not related to this PR, but there are SHARP_DTYPE_UINT8 and SHARP_DTYPE_UINT8 defined in sharp.h, should we update mapping? cc @bureddy

src/components/mc/cpu/mc_cpu.c

src/components/mc/cuda/kernel/mc_cuda_reduce_ops.h

test/mpi/buffer.cc

src/components/mc/base/ucc_mc_base.h

src/components/mc/cpu/mc_cpu.c

src/components/mc/cpu/reduce/mc_cpu_reduce_double_complex.c

src/components/mc/cuda/kernel/mc_cuda_reduce.cu

test/mpi/main.cc

test/gtest/coll/test_reduce.cc

test/mpi/main.cc

vspetrov · 2022-06-20T09:38:31Z

bot:retest

vspetrov

@samnordmann plz address final set of minor changes

src/components/mc/cuda/kernel/mc_cuda_reduce.cu

test/gtest/common/gtest.h

test/mpi/main.cc

* API: add float128 and float32(64,128)_complex dt * TEST: update mpi_tests with new dt * TEST: update Gtest with new dt * BUILD: check dt size during preprocessing

samnordmann force-pushed the complex_dt branch from af8e055 to 0452c87 Compare April 27, 2022 09:18

samnordmann added the WIP - Don't Merge label Apr 27, 2022

samnordmann changed the title ~~add support for float128 datype on CPU, all collectives, and reductio…~~ API: add float128 and float32(64,128)_complex dt Apr 27, 2022

samnordmann force-pushed the complex_dt branch 2 times, most recently from 128cba3 to 80f7bd6 Compare April 27, 2022 11:45

samnordmann requested review from Sergei-Lebedev, shimmybalsam and vspetrov April 27, 2022 11:49

samnordmann added Ready-for-Review and removed WIP - Don't Merge labels Apr 27, 2022

manjugv removed the Ready-for-Review label Apr 27, 2022

shimmybalsam reviewed May 2, 2022

View reviewed changes

jladd-mlnx suggested changes May 5, 2022

View reviewed changes

vspetrov mentioned this pull request May 5, 2022

Gtest update #498

Merged

samnordmann force-pushed the complex_dt branch 3 times, most recently from 8b42a71 to 20d60a8 Compare May 9, 2022 09:24

samnordmann force-pushed the complex_dt branch 2 times, most recently from cd992f2 to 9bed721 Compare May 9, 2022 09:44

samnordmann requested a review from shimmybalsam May 9, 2022 11:16

samnordmann force-pushed the complex_dt branch from 9bed721 to 1e9b514 Compare May 9, 2022 11:22

shimmybalsam approved these changes May 9, 2022

View reviewed changes

src/core/ucc_dt.c Outdated Show resolved Hide resolved

test/mpi/buffer.cc Outdated Show resolved Hide resolved

samnordmann force-pushed the complex_dt branch 5 times, most recently from db2ae78 to 3a5f501 Compare May 12, 2022 12:27

manjugv added the WIP - Don't Merge label May 25, 2022

samnordmann force-pushed the complex_dt branch from cb574e1 to 4108222 Compare June 7, 2022 18:03

samnordmann added the Ready-for-Review label Jun 7, 2022

samnordmann requested a review from vspetrov June 7, 2022 22:02

vspetrov removed the WIP - Don't Merge label Jun 8, 2022

Sergei-Lebedev reviewed Jun 8, 2022

View reviewed changes

samnordmann force-pushed the complex_dt branch from 4108222 to 069e7ca Compare June 13, 2022 10:24

samnordmann requested a review from Sergei-Lebedev June 13, 2022 13:39

vspetrov reviewed Jun 14, 2022

View reviewed changes

samnordmann force-pushed the complex_dt branch 2 times, most recently from 6ee4591 to 4b54615 Compare June 17, 2022 23:29

Sergei-Lebedev reviewed Jun 20, 2022

View reviewed changes

test/mpi/main.cc Outdated Show resolved Hide resolved

samnordmann force-pushed the complex_dt branch 5 times, most recently from 2cb6da8 to 50ae438 Compare June 20, 2022 13:16

samnordmann requested review from Sergei-Lebedev and vspetrov June 20, 2022 15:02

vspetrov approved these changes Jun 20, 2022

View reviewed changes

src/components/mc/cuda/kernel/mc_cuda_reduce.cu Outdated Show resolved Hide resolved

test/gtest/common/gtest.h Outdated Show resolved Hide resolved

test/mpi/main.cc Outdated Show resolved Hide resolved

Sergei-Lebedev approved these changes Jun 21, 2022

View reviewed changes

samnordmann added 2 commits June 21, 2022 10:17

API: add float128 and float32(64,128)_complex dt

8167137

TEST: update mpi_tests with new dt

2b18db6

samnordmann force-pushed the complex_dt branch from 50ae438 to c062a0d Compare June 21, 2022 07:25

samnordmann added 2 commits June 21, 2022 11:27

TEST: update Gtest with new dt

1254b9e

BUILD: check dt size during preprocessing

f94f11d

samnordmann force-pushed the complex_dt branch from c062a0d to f94f11d Compare June 21, 2022 08:28

vspetrov merged commit 8afd34a into openucx:master Jun 21, 2022

Conversation

samnordmann commented Apr 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Uh oh!

manjugv commented Apr 27, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jladd-mlnx left a comment

Choose a reason for hiding this comment

Uh oh!

samnordmann commented May 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Sergei-Lebedev Jun 8, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vspetrov commented Jun 20, 2022

Uh oh!

vspetrov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

samnordmann commented Apr 27, 2022 •

edited

Loading

samnordmann commented May 9, 2022 •

edited

Loading